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O ; ABSTRACT 



We study the recently discovered gravitational lens SDSS J1004+4112, the first quasar lensed by a cluster of 
O , galaxies. It consists of four images with a maximum separation of 14f'62. The system was selected from the 

photometric data of the Sloan Digital Sky Survey (SDSS), and has been confirmed as a lensed quasar at z = 1 .734 
on the basis of deep imaging and spectroscopic follow-up observations. We present color-magnitude relations for 
, galaxies near the lens plus spectroscopy of three central cluster members, which unambiguously confirm that a 

' cluster at z = 0.68 is responsible for the large image separation. We find a wide range of lens models consistent 

with the data, and despite considerable diversity they suggest four general conclusions: (1) the brightest cluster 
CS| ■ galaxy and the center of the cluster potential well appear to be offset by several kiloparsecs; (2) the cluster mass 

^> ' distribution must be elongated in the North-South direction, which is consistent with the observed distribution of 

0^ , cluster galaxies; (3) the inference of a large tidal shear (^^0.2) suggests significant substructure in the cluster; and 

(N . (4) enormous uncertainty in the predicted time delays between the images means that measuring the delays would 

greatly improve constraints on the models. We also compute the probability of such large separation lensing in 
the SDSS quasar sample, on the basis of the Cold Dark Matter model. The lack of large separation lenses in 
previous surveys and the discovery of one in SDSS together imply a mass fluctuation normalization (Ts = I O-0 2 
. (95% confidence), if cluster dark matter halos have an inner density profile p (x r"'-^. Shallower profiles would 

require higher values of erg. Although the statistical conclusion might be somewhat dependent on the degree of 
the complexity of the lens potential, the discovery of SDSS J1004H-41 12 is consistent with the predictions of the 
abundance of cluster-scale halos in the Cold Dark Matter scenario. 
O \ Subject headings: cosmology: observation — cosmology: theory — dark matter — galaxies: clusters: general — 

gravitational lensing — quasars: general — quasars: individual (SDSS J100434.91H-41 1242.8) 

1. INTRODUCTION inally estimated by Turner, Ostriker, & Gott (1984) to be 0. 1 %- 

„ c,. ..J- f.u c 4. -4. t- 11 1 J 1%, assuming that galaxies can be modeled as singular isother- 

■ Since the discovery of the first gravitationally lensed quasar , , foTos %, ■ .• • . , . , 

■ r^n^s^,s£1 /\\t ^ u 11 e \\t /r,nr,\ u ^ on mal Spheres (SIS). This prediction has been Verified by Several 
rN Q0957H-561 (Walsh, Cai-swefl, & Weymann 1979), about 80 . \ , ,. , , ,,,,,, r, ^ . 

. 1 . u u f Aiif^ui J optical and radio lens surveys, such as the Hubble Space Tele- 

Z3 strong lens systems have been found so far. All of the lensed ^ o . o /x, . „ , . x , „ 

u • 11 n" A scope (HST) Snapshot Survey (Bahcafl et al. 1992), the Jodrefl 

■ - - quasars have image separations smaller than 7 , and they are , „ , ; » . • r, /t, o 

, . ° , . , . . , 11U . f Bank/Very Large Array Astrometnc Survey (JVAS; Patnaik et 

lensed by massive galaxies (sometimes with small boosts from , , , ^ . t » r,. r, » r,o 

i ^ , \ f , ■ ^ u u r. al. 1992), and the Cosmic Lens All Sky Survey (CLASS; Myers 

surrounding groups or clusters of galaxies). The probability , r,,, , . •• , , 

° 1 ju • . • 1 • • et al. 1995). The lensing probability is sensitive to the volume 

that distant quasars are lensed by intervening galaxies was orig- , . ■ , , , ■ 

of the universe, so it can be used to place interesting constraints 
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on the cosmological constant JIa (Turner 1990; Fukugita, Fu- 
tamase, & Kasai 1990; Kochanek 1996; Chiba & Yoshii 1999; 
Chae et al. 2002, but see Keeton 2002). 

In contrast, lenses with larger image separations should 
probe a different deflector population: massive dark matter ha- 
los that host groups and clusters of galaxies. Such lenses there- 
fore offer valuable and complementary information on struc- 
ture formation in the universe, including tests of the Cold Dark 
Matter (CDM) paradigm (Narayan & White 1988; Cen et al. 
1994; Wambsganss et al. 1995; Kochanek 1995; Flores & Pri- 
mack 1996; Nakamura & Suto 1997). So far the observed 
lack of large separation lensed quasars has been used to in- 
fer that, unlike galaxies, cluster-scale halos cannot be mod- 
eled as singular isothermal spheres (Keeton 1998; Porciani & 
Madau 2000; Kochanek & White 2001; Keeton 2001a; Sarbu, 
Rusin, & Ma 2001; Li & Ostriker 2002, 2003; Oguri 2002; 
Ma 2003). The difference can probably be ascribed to bary- 
onic processes: baryonic infall and cooling have significantly 
modified the total mass distribution in galaxies but not in clus- 
ters (Rees & Ostriker 1977; Blumenthal et al. 1986; Kochanek 
& White 2001). As a result, large separation lenses may con- 
strain the density profiles of dark matter halos of cluster more 
directly than small separation lenses (Maoz et al. 1997; Keeton 
& Madau 2001; Wyiflie, Turner, & Spergel 2001; Takahashi & 
Chiba 2001; Li & Ostriker 2002; Oguri et al. 2002; Oguri 2003; 
Huterer & Ma 2004; Kuhlen, Keeton, & Madau 2004). Alter- 
natively, large separation lensed quasars may be used to place 
limits on the abundance of massive halos if the density profiles 
are specified (Narayan & White 1988; Wambsganss et al. 1995; 
Kochanek 1995; Nakamura & Suto 1997; Mortlock & Webster 
2000; Oguri 2003; Lopes & Miller 2004). Better yet, the full 
distribution of lens image separations may provide a systematic 
diagnostic of baryonic effects from small to large scales in the 
CDM scenario. 

The fact that clusters have less concentrated mass distribu- 
tions than galaxies implies that large separation lensed quasars 
should be less abundant than small separation lensed quasars 
by one or two orders of magnitude. This explains why past 
surveys have failed to unambiguously identify large separa- 
tion lensed quasars (Kochanek, Falco, & Schild 1995; Phillips, 
Browne, & Wilkinson 2001a; Zhdanov & Surdej 2001; Ofek et 
al. 2001, 2002). For instance, CLASS found 22 small separa- 
tion lenses but no large separation lenses among ~15000 radio 
sources (PhilUps et al. 2001b). Although several large sepa- 
ration lensed quasar candidates have been found (e.g., Mort- 
lock, Webster, & Francis 1999), they are thought to be physical 
(unlensed) pairs on the basis of individual observations (e.g.. 
Green et al. 2002) or statistical arguments (Kochanek, Falco, 
& Munoz 1999; Rusin 2002). Recently Miller et al. (2004) 
found 6 candidate lens systems with image separations 9 > 30" 
among ^20000 quasars in the Two-degree Field (2dF) quasar 
sample. Given the lack of high-resolution spectra and deep 
imaging for the systems, however, it seems premature to con- 
clude that they are true lens systems. We note that because the 
expected number of lenses with such large image separations 
in the 2dF sample is much less than unity (Oguri 2003), these 
systems would present a severe challenge to standard models if 
confirmed as lenses. 

To find a first unambiguous large separation lensed quasar, 
we started a project to search for large separation lenses in the 
quasar sample of the Sloan Digital Sky Survey (SDSS; York 
et al. 2000). This project complements ongoing searches for 



small separation lenses in SDSS (e.g., Pindor et al. 2003; In- 
ada et al. 2003a). The SDSS has completed less than half of its 
planned observations, but already it contains more than 30000 
quasars and is superior to previous large separation lens surveys 
in several ways. The full SDSS sample will comprise ^100000 
quasars, so we ultimately expect to find several large separa- 
tion lensed quasars (Keeton & Madau 2001 ; Takahashi & Chiba 
2001; Li & Ostriker 2002; Kuhlen et al. 2004). One of the most 
important advantages of the SDSS in searching for large sepa- 
ration lensed quasars is that imaging in five broad optical bands 
allows us to select lens candidates quite efficiently. 

Recently we reported the discovery of the large separation 
lensed quasar SDSS J 1004+41 12 at z = 1 .73 (Inada et al. 2003b) 
in the SDSS. The quasar itself turned out to be previously iden- 
tified in flie ROSAT All Sky Survey (Cao, Wei, & Hu 1999) and 
the Two-Micron All-Sky Survey (Barkhouse & Hall 2001), but 
was not recognized as a lensed system. Inada et al. (2003b) 
showed that SDSS J1004+41 12 consists of four quasar images 
with the same redshift from the Keck spectroscopy. The col- 
ors of galaxies found by Subaru imaging follow-up observa- 
tions indicated the presence of a cluster of galaxies at z = 0.68. 
Moreover, the configuration of the four images was success- 
fully reproduced by a simple lens model based on a singular 
isothermal ellipsoid mass distribution. All these results strongly 
implied that SDSS J100444112 is the first quasar lens system 
due to a massive cluster-scale object. In this paper, we de- 
scribe photometric and spectroscopic follow-up observations 
of SDSS J1004+4112 in detail. We discuss the spectra of 
lensed quasar components, including puzzling differences be- 
tween emission lines seen in the different images. We analyze 
deep multicolor imaging data to show the existence of a lensing 
cluster more robustly. We also present detailed mass modeling 
of the lens, and discuss the implications of this system for the 
statistics of large separation lenses. 

The paper is organized as follows. Section 2 describes 
the method used to identify large separation lens candidates 
in the SDSS data. The results of follow-up observations are 
summarized in §3. In §4 we perform mass modeling of 
SDSS J1004+4112, and in §5 we consider flie statistical impU- 
cations of the discovery of SDSS J1004+4112. We summarize 
our results and conclusions in §6. Throughout this paper, we 
assume the popular "concordance" cosmology with Qm = 0.27, 
ft A = 0.73, and//o = 70 km s"' Mpc'^ (e.g., Spergel et al. 2003). 

2. CANDIDATE SELECTION FROM THE SDSS OBJECT 
CATALOG 

The SDSS is a survey to image a quarter of the Celestial 
Sphere at high Galactic latitude and to measure spectra of 
galaxies and quasars found in the imaging data (Blanton et al. 
2003). The dedicated 2.5-meter telescope at Apache Point Ob- 
servatory (APO) is equipped with a multi-CCD camera (Gunn 
et al. 1998) with five broad bands centered at 3561, 4676, 6176, 
7494, and 8873 A (Fukugita et al. 1996). The imaging data are 
automatically reduced by a photometric pipeline (Lupton et al. 
2001). The astrometric positions are accurate to about 0"1 for 
sources brighter than r = 20.5 (Pier et al. 2003). The photomet- 
ric errors are typically less than 0.03 magnitude (Hogg et al. 
2001; Smifli et al. 2002). The SDSS quasar selection algorithm 
is presented in Richards et al. (2002). The SDSS spectrographs 
are used to obtain spectra, covering 3800-9200 A at a resolu- 
tion of 1800-2100, for the quasar candidates. The public data 
releases of the SDSS are described in Stoughton et al. (2002) 
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and Abazajian et al. (2003). 

Large separation lens candidates can be identified from the 
SDSS data as follows. First, we select objects that were initially 
identified as quasars by the spectroscopic pipeline. Specifically, 
among SDSS spectroscopic targets we select all objects that 
have spectral classification of SPEC_QSO or SPEC_HI z_QSO 
with confidence z_conf larger than 0.9 (see Stoughton et al. 
2002, for details of the SDSS spectral codes). Next we check 
the colors of nearby unresolved sources to see if any of those 
sources could be an additional quasar image, restricting the lens 
search to separations 9 < 60". We define a large separation lens 
by ^ > 7" so that it exceeds the largest image separation lenses 
found so far: Q0957+561 with 9 = 6."26 (Walsh et al. 1979) 
and RX J0921+4529 with 9 = 6."97 (Muiioz et al. 2001), both 
of which are produced by galaxies in small clusters. We regard 
the stellar object as a candidate companion image if the follow- 
ing color conditions are satisfied: 

|A(7-A;)| < 3(TA(M) 

y V /quasar \ ' /stellar 

|A(;-^)|<0.1, (2) 

|Ar|<2.5, (3) 

where {j,k} = {u* ,g*}, {g* ,r*}, {r* and {i* ,z*}^'' and A 
denotes the difference between the spectroscopically identified 
quasar and the nearby stellar object. Note that this selection 
criterion is tentative; we still do not know much about large 
separation lenses, so selection criteria may evolve as we learn 
more. 
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Fig. 1. — Redshift distribution of quasars identified by tlie spectroscopic 
pipeline in the SDSS. Dashed vertical Unes show the redshift cut 0.6 < z < 2.3 
used for the statistical analysis. 

Our full sample contains 44269 quasars with the redshift dis- 
tribution shown in Figure 1 . For the lens search we select the 
subset of 29811 quasars with 0.6 < z < 2.3, making the red- 
shift cuts for four reasons: (1) at z < 0.6 quasars are often 
extended, which can complicated both lens searches and also 
lens statistics analyses; (2) at z < 0.6 the sample is contami- 
nated by narrow emission fine galaxies; (3) at z > 2.3 we may 
miss a number of quasar candidates because of large color er- 
rors; and (4) lens statistics calculations for high redshift quasars 

The starred magnitudes (u*g*r*i*z*) are used to denote still-preliminary 2.5m- 

IRAF is distributed by the National Optical Astronomy Observatories, which are 
cooperative agreement with the National Science Foundation. 



are not very reliable because of uncertainties in the quasar lu- 
minosity function (Wyithe & Loeb 2002a,b). Lens surveys of 
high-redshift quasars are of course very interesting for insights 
into the abundance and formation of distant quasars; a search 
for lenses among high-redshift SDSS quasars is the subject of a 
separate analysis by Richards et al. (2004). 

SDSS J1004-I-4112 was first selected as a lens candidate 
based on a pair of components, A and B (see Figure 2), where 
B is the SDSS spectroscopic target. Components C and D were 
identified by visual inspection and found to have colors sim- 
ilar to those of A and B (even though they do not match the 
above color criteria). Table 1 summarizes the photometry for 
the four components, and Table 2 gives the astrometry for the 
four components as well as the galaxy Gl seen in Figure 2. The 
reason that components C and D have somewhat different col- 
ors from B is still unclear, but it must be understood in order 
to discuss the completeness of the lens survey. The difference 
might be ascribed to differential absorption or extinction by in- 
tervening material (Falco et al. 1999), or to variability in the 
source on time scales smaller than the time delays between the 
images (e.g., de Vries, Becker, & White 2003), both of which 
are effects that become more important as the image separation 
grows. 

3. DATA ANALYSIS 

3.L Spectroscopic Follow-up Observations 

3.1.1. Quasar Images 

Since only component B has a spectrum from SDSS, we ob- 
tained spectra of the other components to investigate the lens- 
ing hypothesis. The first spectroscopic follow-up observations 
were done on 2003 May 2 and 5 with the Double Imaging Spec- 
trograph of the Astrophysical Research Consortium (ARC) 3.5- 
m telescope at APO. All four components have a prominent 
C IV emission fine (1549.06 A) at Aobs ~ 4230 A, indicating that 
they are quasars with very similar redshifts. Spectra with higher 
resolution and longer wavelength range were taken on 2003 
May 30 with the Low-Resolution Imaging Spectrometer (LRIS; 
Oke et al. 1995) of the Keck I telescope at the W. M. Keck Ob- 
servatory on Mauna Kea, Hawaii, USA. The blue grism is 400 
line mm"\ blazed at 3400 A, 1.09 A pixer\ covering 3000 A 
to 5000 A. The red grating is 300 fine mm"', blazed at 5000 A, 
1 .09 A pixel"' , covering 5000 A to the red limit of the detector. 
The spectra were obtained with 900 sec exposures and a 1" sUt 
in 0"9 seeing. The data were reduced in a standard way us- 
ing IRAF.^^ The Keck/LRIS spectra are shown in Figure 3. All 
four components show emission lines of Lya, Si IV, C IV, C III], 
and Mg II. They have nearly identical redshifts of z = 1 -734, 
with velocity differences less than 50kms"' (see Table 1). The 
flux ratios between the images (see Figure 3) are almost con- 
stant over the wavelength range 3000-8000 A, indicating that 
these are actual lensed images. From the spectra, we conclude 
that the color differences found in the SDSS images are mainly 
caused by differences in the emission lines (discussed below) 
and by slightly different continuum slopes. 

Several absorption line systems are seen in the spectra. Com- 
ponents A and D have intervening Mg II/Fe II absorption sys- 
tems at z = 0.676; this redshift is similar to that of the fore- 
ground galaxies (§3.1 .2), suggesting that this absorption system 

■based photometry (see Stoughton et al. 2002). 

; operated by the Association of Universities for Research in Astronomy, Inc., under 
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Fig. 3 . — Spectra (top) and flux ratios {bottom) of SDSS J1004+41 12 components A, B, C, and D taken with LRIS on Keck I. In the upper panel, we can confimi 
that all components have Lya, Si IV, C IV, C III], and Mg II emission lines at z = 1.734. The flux ratios shown in the lower panel are almost constant for a wide 
range of wavelength. Several absorption lines are also seen in the spectra (see text for details). 
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Fig. 4.— The Mg II doublet absorption lines (rest frame wavelengths of 2795.5 A and 2802.7 A, rest frame equivalent width > 0.5 A) of SDSS J1004+41 12 
components A, B, C, and D at various wavelengths. The absorption lines are indicated by vertical lines. 



is associated with the lensing galaxies. Component D has ad- 
ditional Mg II absorption systems at z = 0.726, 0.749, 1.083, 
1.226, and 1.258. Figure 4 identifies the various Mg II ab- 
sorbers. We also note that all four components have C IV ab- 
sorption lines just blueward of C IV emission lines (see Fig- 
ure 5). The velocity difference between the emission and ab- 
sorption lines is ~ 500km s"', so the absorption system is likely 
to be associated with the quasar. The fact that all four com- 
ponents have C IV absorption lines offers further evidence that 
SDSS J1004H-41 12 is indeed a gravitational lens. 
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Fig. 5.— The C IV lines of SDSS J1004+4112 components A, B, C, and 
D taken with LRIS. The associated C IV doublet absorption lines (rest frame 
wavelengths 1548.2 A and 1550.8 A, denoted by dotted lines) are seen in all 
four components. 

Figure 5 shows notable differences in the C IV emission line 
profiles in the different components. One possible explana- 
tion is the time delay between the lensed images; at any given 



observed epoch, the images represent different epochs in the 
source frame. However, the fact that the C IV emission lines 
in components A and B differ seems to rule out the time de- 
lay explanation: the expected delay (see §4.2.2) is shorter than 
the month or year time scale on which C IV emission lines typi- 
cally vary (e.g., Vanden Berk et al. 2004). Other possible expla- 
nations include differences between the viewing angles probed 
by the images, microlensing amplification of part of the quasar 
emission region, significant errors in the predicted time delay 
between A and B, or just that the quasar is extremely unusual. 
Understanding the puzzling line profile differences will require 
further observations, preferably including measurement of the 
time delays and spectroscopic monitoring to identify any vari- 
ability in the C IV lines. 



3.1.2. Galaxies 
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Fig. 6. — Spectrum of the galaxy Gl taken with LRIS on Keck I. The break, 
Ca II H&K absorption lines, and Mg absorption line are consistent with red- 
shift z = 0.680 iz = 0.6799 ± 0.0001 from the Ca II H line). The G-band also 
appears in the spectrum. 
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The spectrum of the galaxy Gl, the brightest object near the 
center of the quasar configuration (see Figure 2), was acquired 
on 2003 May 30 with LRIS. The spectrum measured from a 900 
sec exposure is shown in Figure 6. We confirm the break and 
Ca II H&K lines at Aobs ^ 6700 A. The G-band also appears in 
the spectrum. From the Ca 11 H&K and Mg lines we derive the 
redshiftofGl as z = 0.680. 
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Fig. 7. — Spectra of galaxies G2 and G3 taken with FOCAS on the Subaru 
8.2-m telescope. From the absorption lines Ca II H&K, US, and G-band, we 
find that the redshifts of both galaxies are z = 0.675 (z = 0.6751 ± 0.0001 from 
the H5 lines). 

The spectra of two additional galaxies near Gl (see §3.2) 
were taken on 2003 June 20 with the Faint Object Camera and 
Spectrograph (FOCAS; Kashikawa et al. 2002) on the Subaru 
8.2-m telescope of the National Astronomical Observatory of 
Japan on Mauna Kea, Hawaii, USA. We used the 300B grism 
together with the SY47 filter, and took optical spectra covering 
4100 A to lOOOOA with resolution 2.84 A pixel'^ The seeing 
was 0."7, and the exposure time was 1740 sec for both galax- 
ies. The data were reduced in a standard way using IRAF. The 
spectra are shown in Figure 7. Both galaxies, denoted as G2 
and G3, have z = 0.675, only 700kms~^ from the redshift of 
Gl. 

3.2. Imaging Follow-up Observations 

3.2.1. Observations 

A deep r-band image of SDSS J1004H-4112 was taken on 
2003 May 5 with the Seaver Prototype Imaging camera of the 
ARC 3.5-m telescope at APO. The image shows rich structure, 
with many galaxies between and around the quasar components 
suggesting a possible galaxy cluster in the field. For a further 
check, we obtained deeper multi-color (griz) images on 2003 
May 28 with the Subaru Prime Focus Camera (Suprime-Cam; 
Miyazaki et al. 2002) on the Subaru 8.2-m telescope. The ex- 
posure times and limiting magnitudes of the Suprime-Cam im- 
ages are given in Table 3. Suprime-Cam has a pixel scale of 
0"2pixer\ and the seeing was 0"5-0."6. The frames were 
reduced (bias-subtracted and flat-field corrected) in a standard 
way. The resulting images are shown in Figure 8. It is clear 



that there are many red galaxies around the four images. More- 
over, we find three possible lensed arclets (distorted images of 
galaxies behind the cluster), which are shown in more detail in 
Figure 9. The fact that the arclets are relatively blue compared 
with the brighter galaxies in the field (see Figure 2 in Inada et 
al. 2003b) suggests that the arclets may be images of distant 
galaxies (e.g., CoUey, Tyson, & Turner 1996). Confirming that 
they are lensed images will require higher resolution images 
and measurements of the arclets' redshifts. If the hypothesis 
is confirmed, the arclets will provide important additional con- 
straints on the lens mass distribution. 



3.2.2. Colors of Nearby Galaxies 

The colors of galaxies in flie vicinity of SDSS J1004-t4112 
can help us search for the signature of a cluster. The central 
regions of clusters are dominated by early-type galaxies (e.g.. 
Dressier 1980) that show tight correlations among their photo- 
metric properties (Bower, Lucey, & Elhs 1992). These cor- 
relations make it possible to search for clusters using color- 
magnitude and/or color-color diagrams (Dressier & Gunn 1992; 
Gladders & Yee 2000; Goto et al. 2002). 

We measure the colors of galaxies using the deep Suprime- 
Cam griz images. Object identifications are performed us- 
ing the Source Extractor algorithm (SExtractor; Bertin & 
Arnouts 1996); we identify objects with SExtractor parameter 
CLASS_STAR smaller than 0.6 in the i band image as galaxies. 
Note that this star/galaxy separation criterion is successful only 
for objects with / < 24. The magnitudes in the images are cali- 
brated using nearby stars whose magnitudes are taken from the 
SDSS photometric data. 

Since the red galaxies in clusters are dominant in the central 
regions, and the center of the cluster is thought to be near Gl, 
we divide the galaxies in the field into three categories: galaxies 
inside a 40" x 40" (corresponding to 0.2h~^Mpc x 0.2/!~'Mpc 
at z = 0.68) box centered on Gl; galaxies inside a 100" x 100" 
(0.5/i"'Mpc X 0.5/i"'Mpc) box (except for those in the first 
category); and galaxies inside a 200" x 200" (1.0/j"'Mpc x 
1.0/!~'Mpc) box (except for those in the first two categories). 
Figure 10 shows color-magnitude diagrams for the three cat- 
egories. It is clear that the color-magnitude relations, particu- 
larly r-i and i-z, show tight correlations for galaxies inside the 
40" X 40" box. Ridge lines at r-j ~ 1.1 and i-z ~ 0.5 strongly 
suggest a cluster of galaxies at z ^ 0.6 (Goto et al. 2002). The 
result is consistent with the Keck and Subaru spectroscopic re- 
sults showing that the redshifts of galaxies Gl, G2, and G3 are 
all z~ 0.68. 

We identify cluster members by their location in color-color 
space (Dressier & Gunn 1992; Goto et al. 2002). We show 
g-r-i and r-i-z color-color diagrams in Figures 11 and 12, 
respectively. We restrict the plots to galaxies brighter than ; = 24 
because of the Umitation of the star/galaxy separation. We 
make color-color cuts based on the colors expected of elliptical 
galaxies (Fukugita, Shimasaku, & Ichikawa 1995): g-r > 1.5, 
r-i > 0.7, and i-z > 0.2 for elliptical galaxies at z > 0.5. The 
galaxy distributions with and without the color cuts are shown 
in Figure 13. The galaxies that survive the color cuts are con- 
centrated around Gl, so we conclude that they are candidate 
members of a cluster of galaxies at z = 0.68 whose center is near 
Gl. We note that the distribution of candidate cluster members 
is not spherical and appears to be elongated North-South. 
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Fig. 9. — The central region of the Suprime-Cam i-band image. The galaxies with measured redshifts (Gl from LRIS and G2 and G3 from FOCAS) as well as 
the four lensed images are labeled. The possible lensed arclets are marked with rectangles. 
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Fig. 10. — Color-magnitude diagrams for the SDSS J 1004+41 12 field taken with Suprime-Cam. We divide the galaxies into three categories according to their 
positions: filled circles denote galaxies inside a 40" X 40" box centered on Gl, open triangles denote galaxies inside a 100" X 100" box, and crosses denote 
galaxies inside a 200" X 200" box. These box sizes correspond to 0.2/!"' Mpc X 0.2/7"' Mpc, 0.5/i"'Mpc X 0.5/i"'Mpc. and 1.0/r'Mpc X l.OA^'Mpc at z = 0.68, 
respectively. Thi'ee spectroscopically confirmed member galaxies are marked with open circles. The corresponding r-band absolute magnitudes at z = 0.68 (without 
K-correction) are given at the top of the frame. 
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4. MASS MODELING 
4.1. One-component Models 
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Fig. 13. — Distributions of galaxies brighter than i = 24 with (right) and without (left) the color cut. The origin (0,0) is set to the position of the central galaxy 
Gl. Filled squares denote the four lensed images. 



To search for mass models that can explain the image config- 
uration of SDSS J 1004+4 112, we use standard lens modeling 
techniques as implemented in the software of Keeton (2001b). 
The main constraints come from the image positions. We also 
use the flux ratios as constraints, although we broaden the er- 
rorbars to 20% to account for possible systematic effects due to 
source variability and time delays, micro- or milli-lensing, or 
differential extinction (See Table 4 for the full set of constraint 
data). In particular, the different colors of the images and the 
different absorption features seen in Figure 3 suggest that dif- 
ferential extinction may be a significant effect. In this section 
we do not use the position of the main galaxy as a constraint, 
because we want to understand what constraints can be placed 
on the center of the lens potential from the lens data alone. 

We first consider the simplest possible models for a 4-image 
lens: an isothermal lens galaxy with a quadrapole produced ei- 
ther by ellipticity in the galaxy or by an external shear. A spher- 
ical isothermal lens galaxy has surface mass density 



E(r) rein 
^cnt ^' 



(4) 



where rein is flie Einstein radius of the lens, and Sent = 
(c^/47rG)(Dos/OoL£'Ls) is the critical surface mass density for 
lensing, with Dql, Dqs, and Z)ls being angular diameter dis- 
tances from the observer to the lens, from the observer to the 
source, and from the lens to the source, respectively. The Ein- 
stein radius is related to the velocity dispersion a of the galaxy 
by 



= 477 



Dos 



(5) 



For an elliptical model we replace r with r[I -f((I — ^^)/(I + 
^^))cos2(^-^e)]'/-^ in the surface density, where q and 6e are 
the axis ratio and position angle of the ellipse. 

Simple models using either pure ellipticity or pure shear fail 
miserably, yielding values no better than 2 x 10"* for A'dof = 4 
degrees of freedom. This failure is not surprising: most 4- 
image lenses require both ellipticity and external shear (e.g., 
Keeton, Kochanek, & Seljak 1997), and such a situation is 
likely in SDSS J1004+4II2 since the main galaxy is observed 
to be elongated and the surrounding cluster surely contributes a 
shear. 



We therefore try models consisting of a singular isothermal 
ellipsoid (SIE) plus an external shear 7. Even though such mod- 
els are still comparatively simple, they can fit the data very well 
with a best-fit value of = 0.33 for A'dof = 2. The best-fit model 
has an Einstein radius rein = 6. "9 = 35 kpc corresponding to 
a velocity dispersion of 700 km s~', an ellipticity e = 0.50 at 
position angle 9e = 21. °4, and an external shear 7 = 0.25 at po- 
sition angle = -60.°9. Among other known lenses, such a 
large shear is found only in lenses lying in cluster environments 
(Burud et al. 1998; Barkana et al. 1999). Figure 14 shows the 
critical curves and caustics for the best-fit model. The inferred 
source position lies very close to the caustic and fairly near a 
cusp, implying that the total magnification is ^57. Figure 15 
shows the allowed ranges for the position of the deflector and 
the ellipticity and external shear in the model. The models indi- 
cate a small but significant offset of 1"6 = 7.9 /i"' kpc between 
the center of the lens potential and the main galaxy, although it 
remains to be seen whether that offset is real or an artifact of 
these still simple lens models. 

4.2. Two-component Models 

Even though the simple SIE+shear model provides a good 
fit to the data, we beheve that it is not physically plausible be- 
cause the system clearly has multiple mass components and it 
seems unlikely that all of the mass is associated with a single 
^700 km s"' isothermal component. The next level of compli- 
cation is to add a mass component representing the cluster halo. 
We still model the galaxy GI expUcitly, treating it as an isother- 
mal ellipsoid constrained by its observed position. At this point 
we do not further complicate the model by attempting to treat 
the other galaxies within the lens expUcitiy. 

4.2.1. Methods 

We model the cluster component with an NEW profile 
which has been predicted in cosmological A^-body simulations 
(Navarro, Frenk, & White 1996, 1997): 

Pcnt(z)5c(z) 



P(r)=- 



(6) 



(r/r,)(l+r/r,) 

where is a scale radius, 5c is a characteristic overdensity 
(which depends on redshift), and Pcrit(z) is the critical density of 
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the universe. Although the NFW density profile appears to de- 
viate from the results of more recent A^-body simulations in the 
innermost region (Moore et al. 1999; Ghigna et al. 2000; Jing 
& Suto 2000; Klypin et al. 2001; Fukushige & Makino 1997, 
2001, 2003; Power et al. 2003; Fukushige, Kawai, & Makino 
2004; Hayashi et al. 2004), we adopt this form for simphcity. 
The lensing properties of a spherical NFW halo are described 
by the lens potential (Bartelmann 1996; Golse & Kneib 2002; 
Meneghetti, Bartelmann, & Moscardini 2003a) 



hi^ arctanh^ \l 1 - (r/rs) 



where the lensing strength is specified by the parameter 

^s^c(z)Pcrit(z) 



Kc — 



(7) 



(8) 



Since asphericity in the cluster potential is important in mod- 
ehng this system, we generalize the spherical model by adopt- 
ing elliptical synmietry in the potential. Making the potential 
(rather than the density) elliptical makes it possible to compute 
the lensing properties of an NFW halo analytically (Golse & 
Kneib 2002; Meneghetti et al. 2003a). We may still be over- 
simplifying the mass model, because the cluster profile may 
have been modified from the NFW form by baryonic processes 



such as gas coohng (Rees & Ostriker 1977; Blumenthal et al. 
1986), and the cluster may have a complex angular structure if 
it is not relaxed (e.g., Meneghetti et al. 2003a). To allow for 
the latter possibility, we still include a tidal shear in the lens 
model that can approximate the effects of complex structure in 
the outer parts of the cluster. Overall, our goal is not to model 
all of the complexities of the lens potential, but to make the 
minimal realistic model and see what we can learn. 

Even with our simplifying assumptions, we still have a com- 
plex parameter space with 1 1 parameters defining the lens po- 
tential: the mass, ellipticity, and position angle for the galaxy 
Gl; the position, mass, scale radius, ellipticity, and position an- 
gle for the cluster; and the amplitude and position angle of the 
shear. There are also three parameters for the source (position 
and flux). With just 12 constraints (position and flux for each 
of four images), the models are under-constrained. We there- 
fore expect that there may be a range of lens models that can fit 
the data. To search the parameter space and identify the range 
of models, we follow the technique introduced by Keeton & 
Winn (2003) for many-parameter lens modeling. Specifically, 
we pick random starting points in the parameter space and then 
run an optimization routine to find a (local) minimum in the 
surface. Repeating that process numerous times should reveal 
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Fig. 16. — Allowed parameter ranges for galaxy+cluster lens models with a cluster scale radius n = 40". (a) The position of the cluster component. The fiUed 
circles mark the image positions, and the open circle marks the observed brightest cluster galaxy Gl. (b) The eUipticity and position angle of the galaxy component, 
(c) The elUpticity and position angle of the cluster component, (d) The ampUtude and position angle of the external shear. Small points show all models, while 
boxes mark models where the model galaxy is roughly aligned with the observed galaxy (fie = -19.°9 ± 20?0). 



different minima and thereby sample the full range of models. 
Many of the recovered models actually lie in local minima that 
do not represent good fits to the data, so we only keep recovered 
models with < 11.8 (which represents the 3(t limit relative 
to a perfect fit when examining two-dimensional slices of the 
allowed parameter range; see Press et al. 1992). 

We make one further cut on the models. From the previ- 
ous section, we know that an SlE-nshear lens model can give a 
good fit to the data. Thus, there are acceptable two-component 
models where most or all of the mass is in the galaxy compo- 
nent and the cluster contribution is negligible. To exclude such 
models as physically implausible, we impose an upper limit on 
the velocity dispersion of the model galaxy. Specifically, we 
only keep models with cTgai < 400 km s"\ because there are 
essentially no galaxies in the observed universe with larger ve- 
locity dispersions, even in rich clusters (e.g.. Kelson et al. 2002; 
Bernard! et al. 2003; Sheth et al. 2003). Formally, we impose 
this cut as an upper limit rein < 2."25 on the Einstein radius of 
the galaxy Gl. 

4.2.2. Results 

We first consider models where the scale radius of the clus- 
ter is fixed as = 40" (we shall justify this choice below). 
Figure 16 shows the allowed parameter ranges for acceptable 
models. First, panel (a) shows that the cluster component is 



restricted to a fairly small (but not excessively narrow) range 
of positions near the center of the image configuration. This 
is mainly a result of our upper limit on the mass of the galaxy 
component; there is a certain enclosed mass implied by the im- 
age separation, and if the galaxy cannot contain all of that mass 
then the cluster component must he within the image configu- 
ration to make up the difference. It is interesting to note that 
even in these more complicated models there still seems to be a 
small offset between the center of the cluster component and the 
brightest cluster galaxy G 1 , although the lower Umit implied by 
our models is just 0."71 = 3.6/i~^kpc. 

Figure 16b shows that the allowed values for the ellipticity 
and position angle of the galaxy Gl basically fill the parame- 
ter space, so these parameters are not constrained by the lens 
data. We might want to impose an external constraint, however. 
Analyses of other lens systems show that the lensing mass is 
typically aligned with the projected light distribution (Keeton, 
Kochanek, & Falco 1998; Kochanek 2002). We may therefore 
prefer lens models where the model galaxy is at least roughly 
aUgned with the observed galaxy, which has a position angle of 
-19.°9. To illustrate this possible selection, we show all models 
but highlight those where the position angle of the model galaxy 
is in the range 6*, = -19.°9 ± 20.°0. The broad 20° uncertainties 
prevent this constraint from being too strong. 
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Fig. 17. — Critical curves and caustics for sample galaxy+cluster lens models with a cluster scale radius = 40". In each panel, the left-hand side shows the 
critical curves in the image plane, and the right-hand side shows the caustics in the source plane on the same scale. The points in the image plane show the observed 
image positions, and the point in the source plane shows the inferred source position. The value of \x gives the total magnification in each model. 



Figure 16c shows that there are some acceptable models 
where the cluster potential is round, but most models have some 
ellipticity that is aligned roughly North-South. This is in good 
agreement with the distribution of member galaxies which is 
also aligned roughly North-South (see Figure 13). The elUp- 
ticity e 0.2-0.4 is actually quite large, considering that this 
parameter describes the ellipticity of the potential, not that of 
the density. Figure 16d shows that all of the acceptable models 
require a fairly large tidal shear 7 > 0.10, and models where 
the galaxy is aligned with the observed galaxy have a strong 
shear 7 > 0.23. The shear tends to be aligned East-West. The 
fact that the models want both a large cluster ellipticity and a 
large tidal shear strongly suggest that there is complex struc- 
ture in the cluster potential outside of the image configuration. 
It would be interesting to see whether there is any evidence for 
such structure in, for example. X-rays from the cluster. 

Figure 17 shows critical curves and caustics for sample lens 
models. The critical curves are not well determined. The outer, 
tangential critical curve can point either northeast (panel e) or 
northwest (panel d), or it can have a complex shape (panel a). 
Sometimes there is just one irmer, radial critical curve (panel 



e), but often there are two (panel c). The distance of the source 
from the caustic (and of the images from the critical curve) 
varies from model to model, so the total magnification can 
range from ~50 to several hundred or even more. Finally, per- 
haps the most interesting quahtative result is that even the im- 
age parities are not uniquely determined. In most models (e.g., 
panels a-f) images A and D lie inside the critical curve and have 
negative parity while B and C lie outside the critical curve and 
have positive parity. However, in some models (e.g., panels g- 
h) the situation is reversed. Having ambiguous image parities 
is very rare in lens modeling. 

So far we have only discussed models where the cluster has 
a scale radius = 40". We have also computed models with 
rs = (10,20,30,50,60) arcsec and we find that all of the results 
are quite similar. To understand what value of the scale ra- 
dius is reasonable, we must consider which (if any) of the mod- 
els have physically plausible cluster parameters. Even though 
NFW models are formally specified by two parameters and 
Kg, A?-body simulations reveal that the two parameters are ac- 
tually correlated. NFW models therefore appear to form a one- 
parameter family of models, although with some scatter which 



SDSS J1004+4112 



13 



reflects the scatter of the concentration parameter Cyir = r^h/r^ 
(rvir is a virial radius of the cluster). Figure 18 shows the pre- 
dicted relation between and Kj, including the scatter For 
comparison, it also shows the fitted values of in lens mod- 
els with different scale radii. Models with = 10" or 20" re- 
quire Ks much larger than expected, corresponding to a halo that 
is too concentrated. Models with > 30", by contrast, over- 
lap with the predictions and thus are physically plausible. We 
can therefore conclude very roughly that the cluster component 
must have > 30" and a total virial mass M > 10''*/!~'M(7,. 
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Fig. 18. — Relation between the cluster scale radius and lensing strength 
Ks. The solid line shows the predicted relation for clusters with the canonical 
median concentration, and the dotted lines show the Ict range due to the scatter 
in concentration (see §5.1.2). The labeled points show the value of log(M) (in 
units of /i^'Mq) at various points along the curve. The points show fitted val- 
ues of Ks for lens models with rs = (10,20,30,40,50,60) arcsec. As in Figure 
16, small points show all models, while boxes mark models where the model 
galaxy is roughly aligned with the observed galaxy. 



it would allow a good estimate of the long C-D delay and indi- 
cate whether attempting to measure that delay would be worth- 
while. Second, the enormous range of predicted time delays 
means that constraining the Hubble constant with this system 
(Refsdal 1964) will be difficult because of large systematic un- 
certainties in the lens models. Although Koopmans et al. (2003) 
recently showed that it is possible to obtain useful constraints 
on the Hubble constant even in a complex system with two mass 
components, the analysis is very complex and requires exten- 
sive data including not just the image positions and all of the 
time delays, but also an Einstein ring image and the velocity 
dispersion of one of the mass components. Even if we obtain 
such data for SDSS J1004H-4112 in the near future, it seems 
likely that it will be difficult to obtain reliable constraints on 
the Hubble constant given the complexity of the lens potential 
in SDSS J1004H-41 12. The time delays, however, would still be 
extremely useful, because they would determine the temporal 
ordering and hence the image parities, and they would provide 
constraints that can rule out many of the models that are cur- 
rently acceptable. 
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Finally, we can use the models to predict the time delays be- 
tween the images. The models always predict that the time 
delay between images C and D is the longest and the delay 
between A and B is the shortest. However, there is no ro- 
bust prediction of the temporal ordering: most models predict 
that the sequence should be C-B-A-D, but a few models pre- 
dict the reverse ordering D-A-B-C. This is a direct result of 
the ambiguity in the image parities, because the leading im- 
age is always a positive-parity image (e.g., Schneider, Ehlers, 
& Falco 1992). We note, however, that all of the models where 
the model galaxy is roughly aligned with the observed galaxy 
have the C-B-A-D ordering. 

Figure 19 shows the predictions for the long and short time 
delays. The long delay between C and D can be anything up to 
^ 3000 days, while the short delay between A and B can be 
up to ^ 37 days. For the models where the galaxy is roughly 
aligned with the observed galaxy, the two delays are approxi- 
mately proportional to each other with Atco/^tBA = 143 ± 16. 
These results have several important implications. First, the A- 
B time delay should be on the order of weeks or months, so it 
should be very feasible to measure it, provided that the source 
has detectable variations. Measuring the A-B delay would be 
very useful because it would determine the temporal ordering, 
and thereby robustly determine the image parities. In addition. 
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Fig. 19. — Predictions for the longest (Atco) and shortest (AtBA) tiine de- 
lays, where Atjj > means image leads image j, and vice versa. Results are 
shown for models where the cluster has scale length = (30,40,50,60) arc- 
sec. As in Figure 16, small points show all models, while boxes mark models 
where the model galaxy is roughly aligned with the observed galaxy. 



5. LENS STATISTICS 

In this section, we calculate the expected rate of large sep- 
aration lensing in the SDSS quasar sample. The discovery of 
SDSS J1004H-41 12 allows us to move past the upper limits ob- 
tained from previous large separation lens searches, although at 
present the main thing we can do is test whether the detection 
of one large separation lens in the current sample is consistent 
with standard theoretical models in the CDM scenario. 

5.1. Modeling Lens Probabilities 

We calculate lensing probabilities using spherical models for 
simplicity. Although halos in CDM simulations are in fact tri- 
axial (e.g., Jing & Suto 2002), the spherical assumption is of- 
ten adopted in lens statistics calculations because deviations 
from spherical symmetry mainly affect image multipHcities, not 
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image separations or the total optical depth for lensing (e.g., 
Kochanek 1996; Keeton et al. 1997; Chae 2003). While this re- 
sult has been obtained for isothermal lens potentials, checking 
it for more general halos is beyond the scope of this paper and 
is the subject of a follow-up analysis (M. Oguri et al., in prep.). 

5.1.1. Lens Probabilities 

Let the physical image position in the lens plane and physi- 
cal source position in the source plane as ^ and rj, respectively. 
Consider the probabiUty that a quasar at zs with luminosity L is 
strongly lensed. The probability of lensing with image separa- 
tion larger than 6 is given by 



P\>0;zs,L): 
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Jo 



dzhil+ZLr-r- 



f 



dZL 

dM 



(9) 



where ciiens = tt t;^ Dq^/Dq^ is the cross section for lensing, with 
rir being the physical radius of the radial caustic in the source 
plane. The lower limit of the mass integral is the mass Mmin 
that corresponds to the image separation 0; this can be com- 
puted once the density profile of the lens object is specified. 
The magnification bias B(zs,L) is (Turner 1980; Turner et al. 
1984) 



B(zs,L) = 



dr] r] ^izs,L/ fiir])) 



1 



(10) 



r;2$(zs,L)7o '^"'^V(^)' 
where ^(zs,L) is the luminosity function of source quasars. 
Note that the magnification factor /z(?7) may be interpreted as 
the total magnification or the magnification of the brighter or 
fainter image, depending on the observational selection criteria 
(Sasaki & Takahara 1993; Cen et al. 1994). In this paper, we 
adopt the magnification of the fainter image, because we con- 
centrate on the large separation lenses for which the images are 
completely deblended. 

5 . 1 .2. Generalized NFW Profile 

The lensing probability distribution at large separation re- 
flects the properties of dark halos, rather than galaxies (Keeton 
& Madau 2001; Takahashi & Chiba 2001; Li & Ostriker 2002; 
Oguri 2002). For the statistics calculation, the debate over the 
inner slope of the density profile seen in A^-body simulations 
leads us to consider the generalized version (Zhao 1996; Jing & 
Suto 2000) of the NFW density profile (eq. [6]): 

Pcrit(z)^c(z) 



P(r) = 



.3-a • 



(11) 



{r/r,)"{l + r/r,y 

While the correct value of a is still unclear, the existence of 
cusps with 1 < a < 1-5 has been established in recent A'-body 
simulations (Navarro et al. 1996, 1997; Moore et al. 1999; 
Ghigna et al. 2000; Jing & Suto 2000; Klypin et al. 2001; 
Fukushige & Makino 1997, 2001, 2003; Power et al. 2003; 
Fukushige et al. 2004; Hayashi et al. 2004). The case a = 1 
corresponds to the original NFW profile, while the case a =1.5 
resembles the profile proposed by Moore et al. (1999). The 
scale radius is related to the concentration parameter as 
Cvir = rni/rs- Then the characteristic density ddz) is given in 
terms of the concentration parameter: 



A™(z)r!(z) 4, 

Oc(Z) = ■ 



(12) 



where m(cvir) is 

OT(Cvir) = 



3-a 



2F1 (3 -a, 3 -a; 4 -a; -Cvir), 



(13) 



with 2Fi(a,b;c;x) being the hypergeometric function (e.g., 
Keeton & Madau 2001). The mean overdensity Avir(z) can be 
computed using the nonlinear spherical collapse model (e.g., 
Nakamura & Suto 1997). 

We define ^ = ^/r^ and f] = rjDoi^/rsDos- Then the lensing 
deflection angle /3(0 is related to the dark halo profile as fol- 
lows: 

/3(0 = ^ Tdz dx — . 

(14) 

The lensing strength parameter was defined in equation (8). 
For sources inside the caustic (77 < rjr), the lens equation has 
three solutions ^1 > ^2 > ^3' where image #1 is on the same 
side of the lens as the source and images #2 and #3 are on the 
opposite side.'^ The lens image separation is then 



6 = 



DoL ~ £>0L ' 



(15) 



where is a radius of the tangential critical curve (Hinshaw 
& Krauss 1987; Oguri et al. 2002). The magnification of the 
fainter image may be approximated by (Oguri et al. 2002) 



Mfamt(??) ' 



(16) 



??(l-/?'(et)) 

These approximations are sufficiently accurate over the range 
of interest here (see Oguri et al. 2002). Although we adopt a 
selection criterion that the flux ratios should be smaller than 
10:1, this condition does not affect our theoretical predictions 
because the flux ratios of strong lensing by NFW halos are typ- 
ically much smaUer than 10 : 1 (Oguri et al. 2002; Rusin 2002). 

The concentration parameter Cyir depends on a halo's mass 
and redshift. Moreover, even halos with the same mass and 
redshift show significant scatter in the concentration which re- 
flects the difference in formation epoch (Wechsler et al. 2002), 
and which is well described by a log-normal distribution. For 
the median of this distribution, we adopt the mass and redshift 
dependence reported by Bullock et al. (2001) as a canonical 
model: 

10 / M \~^" 
CBuiiock(M,z) = -— — ---- , (17) 



1+z VM,(0) 



where M*(z) is the mass collapsing at redshift z (defined by 
o"m(z) = Sc = 1.68). To study uncertainties related to the con- 
centration distribution we also consider other mass and redshift 
dependences, e.g., 



cchm(M,z) = 10.3(1+2) 



-0.3 



M 



-0.24(l+jr 



from Cooray, Hu, & Miralda-Escude (2000), and 



cjs(M,z) = 2.44. 



Avir(z) V 1+Z 



3/2 



(18) 



(19) 



3 w(Cvir) ' 

" The third image is usually predicted to be very faint, so in practice just two images are actually observed. 



from Jing & Suto (2002), with Zc being the collapse redshift of 
the halo of mass M^i^. Note that these relations were derived 
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under the assumption of a = 1 . We can extend them to a ^ 1 
by multiplying the concentration by a factor 2 — a (Keeton & 
Madau 2001; Jing & Suto 2002). 

The statistics of large separation lenses are highly sensitive 
to the degree of scatter in the concentration (Keeton & Madau 
2001; Wyithe et al. 2001; Kuhlen et al. 2004). Bullock et 
al. (2001, see also Wechsler et al. 2002) found Uc ~ 0.32 in 
their simulations. Jing (2000) found a smaller scatter (Jc ~ 
0.18 among well relaxed halos, but Jing & Suto (2002) found 
(Jc ^ 0.3 if all halos are considered. Therefore we use <Jc = 0.3 
throughout the paper. 

5.1.3. Mass Function 

For the comoving mass function of dark matter halos, un- 
less otherwise specified we adopt equation (B3) of Jenkins et 
al. (2001): 

dnjeakins . f^MPcrit(O) (ilncrj^' , -l/^,D|e^ /om 

-^=^^ ^exp(-|lna,(z)+B|), (20) 

where A = 0.301, B = 0.64, and e = 3.82. We use the approxi- 
mation of ctm given by Kitayama & Suto (1996), and the shape 
parameter presented by Sugiyama (1995). Note that this mass 
function is given in terms of the mean overdensity Ac = 180 
instead of Avir(z). Therefore, the mass function should be con- 
verted correctly (e.g., Komatsu & Seljak 2002). To study uncer- 
tainties related to the mass function we also consider two other 
possibilities: the mass function derived in the Hubble volume 
simulations, dn^yraii/dM, which is given by equation (20) with 
A = 0.22, B = 0.73, e = 3.86 in terms of the mean overdensity 
Ac = 200 / n(z) (Evrard et al. 2002); and the mass function given 
by Sheth & Tormen (1999) 



JnsTW . f^MPcrit(O) 



dM 




a5l 



, , (21) 

TT aMiz) dM V 2a2,(z); 

with A = 0.29, a = 0.66, p = 0.33 in terms of the mean overden- 
sity Ac = 180 (White 2002). 

5.2. Number ofLensed Quasars in the SDSS 

Because the lensing probability depends on the source red- 
shift and luminosity, we compute the predicted number of 
lenses in redshift and luminosity bins and then sum the bins. 
Specifically, let NizjJl) by the number of quasars in a redshift 
range - Az/2 < z < zy + Az/2 that have a magnitude in the 
range il - A/*/2 < /* < il + M* jl. Then the predicted total 
number of lensed quasars is 



MensO Q) = ^ A^(z;, O^O e;zjMil))- (22) 

We adopt bins of width Az = 0.1 and Af = 0.2. The quasar 
sample we used comprises 29811 quasars with mean redshift 
(z) = 1.45 (see Figure 1), and roughly corresponds to a sample 
with magnitude hmit i* = 19.1 (Richards et al. 2002). 

To calculate the B-band absolute luminosity L{i*) corre- 
sponding to observed magnitude /*, we must estimate the cross- 
filter K-correction KbAz)- The K-correction calculated from the 
composite quasar spectrum created from the SDSS sample by 
Vanden Berk et al. (2001) is shown in Figure 20. As a simplifi- 
cation, one might use the following approximation: 



/:k(z) = -2.5(1 -as)log(l+z)-2.5aJog (^^) "0.12, 



where the offset 0. 12 mainly arises from the difference between 
AB(4400) and B magnitudes (calculated assuming a^, = 0.5; 
Schmidt et al. 1995). Throughout the paper, however, we 
use the K-correction directly calculated from composite quasar 
spectrum. 
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Fig. 20. — The cross-filter K-correction, computed from the SDSS compos- 
ite quasar spectrum created by Vanden Berk et al. (2001). Dotted line indicate 
the approximation (eq. [23]) with as = 0.5. 

The luminosity function of quasars is needed to compute 
magnification bias. We adopt the standard double power law 
B-band luminosity function (Boyle et al. 1988) 



^izs,L)dL-- 



dL 



[L/L,(zs)]A + [L/L,(zs)]/3* L,(zs) " 



(24) 



As a fiducial model of the evolution of the break luminosity, we 
assume the form proposed by Madau et al. (1999), 



Uizs)=L,mi+zsr' 



(25) 



where a power-law spectral distribution for quasar spectrum has 
been assumed, fi, cx v'^K Wyithe & Loeb (2002b) determined 
the parameters so as to reproduce the low-redshift luminosity 
function as well as the space density of high-redshift quasars 
for a model with = 3 .43 below zs = 3, /3a = 2.58 above zs = 3, 
and /3; = 1.64. The resulting parameters are = 624Gpc~^, 
L,(0)= 1.50X 10" Lq, z* = 1.60, C = 2.65, and C = 3.30. We 
call this model LFl. To estimate the systematic effect, we also 
use another quasar luminosity function (LF2) derived by Boyle 
et al. (2000): = 3.41, /3, = 1 .58 and an evolution of the break 
luminosity L*(zs) = L,,(0)10*'^'^+*24 with ki = 1.36, k2 = -0.27, 
andM* =-21.15-l-51ogft. 

5.3. Results 

First we show the conditional probability distributions 



dP 

dZL 

dP 
d]nM 



(zL\e,zs,L) = 



(M\e,zs,L): 



d^P/dzLd9 



dP/dO 



d^P/d]nMde 



dP/dO 



(26) 



(27) 



(23) 



in order to identify the statistically reasonable ranges of redshift 
and mass for the lensing cluster. Figure 21 shows the condi- 
tional probability distribution of the lens redshift, and Figure 22 
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shows the conditional probability distribution of the lens mass, 
given that the gravitational lens system SDSS J1004+41 12 has 
image separation ^ 14", source redshift zs = 1.734, and appar- 
ent magnitude /* = 18.86. We find the most probable lens red- 
shift to be zl ~ 0.5, but the distribution is broad and the mea- 
sured redshift zl = 0.68 is fully consistent with the distribution. 
We also find a cluster mass M ^ 2-3 x 10^* Mq to be most 
probable for this system. This result is in good agreement with 
the mass estimated from the lens models (see Figure 18). Note 
that we do not include information on the measured redshift 
Zl = 0.68 in Figure 22, which might cause a slight underesti- 
mate of the lens mass. 
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the previous large separation lens surveys, we adopt the CLASS 
6" <9 < 15" survey comprising a statistically complete sample 
of 9284 flat-spectrum radio sources (Phillips et al. 2001b). For 
the CLASS sample, we use a source redshift zs = 1-3 (Marlow 
et al. 2000) and a flux distribution NiS)dS oc S'^-^dS (Phillips et 
al. 2001b) to compute the expected number of large separation 
lenses. 




Fig. 23. — Contours of the predicted number of large separation (8 > l") 
lenses in the current SDSS sample in the {a. erg) plane. 
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Fig. 21. — Conditional probability distributions for the lens redshift in 
SDSS J1004+4112, given the image separation ~ 14", source redshift 7s = 
1.739, and apparent magnitude i* = 18.86. Solid and dashed lines show the 
probability distributions with a = 1.5 and 1.0, respectively. The arrow shows 
the measured redshift of the lensing cluster We assume erg = 1 .0. 
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Fig. 22. — Same as Figure 21. but the conditional probability distribution of 
the mass of the lens is plotted. 

Next we consider the statistical implications of 
SDSS J1004-I-41 12. Although our large separation lens search 
is still preliminary, and we have several other candidates from 
the current SDSS sample that still need follow-up observations, 
we can say that the current sample contains at least one large 
separation lens system. This is enough for useful constraints 
because of the complementary constraints available from the 
lack of large separation lenses in previous lens surveys. Among 
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Fig. 24. — Constraints from both SDSS and CLASS in the (OjCrg) plane. 
The discovery of one large separation {d > 7") lens in SDSS provides lower 
limits on a and erg, while the lack of large separation lenses (6" <6 < 15") in 
CLASS yields the upper limit. The regions in which both SDSS and CLASS 
limits are satisfied are shown by the shadings. The confidence levels are 
68.3%, 95.5%, and 99.7% in the dark, medium, and light shaded regions, re- 
spectively. 



Figure 23 shows contours of the predicted number of large 
separation lenses with 9 > 7" in the SDSS quasar sample. Since 
the number of lenses is very sensitive to both the inner slope of 
the density profile a and the mass fluctuation normalization erg 
(e.g., Oguri 2002), we draw contours in the (a, erg) plane. Con- 
straints from the existence of SDSS J1004H-41 12 together with 
the lack of large separation lenses in the CLASS sample are 
shown in Figure 24. To explain both observations, we need a 
relatively large a or erg, such as ag = 1.0^0 2 (95% confidence) 
for a = 1.5. This value is consistent with other observations 
such as cosmic microwave background anisotropics (Spergel et 
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al. 2003). By contrast, if we adopt a=l then the required value 
of (Tg is quite large, as > 1.1. Thus, our result might be in- 
terpreted as implying that dark matter halos have cusps steeper 
than a = I. Alternatives to coUisionless CDM, such as self- 
interacting dark matter (Spergel & Steinhardt 2000) or warm 
dark matter (Bode, Ostriker, & Turok 2001), tend to produce 
less concentrated mass distributions which are effectively ex- 
pressed by low a; such models would fail to explain the discov- 
ery of SDSS J1004+41 12 unless as is unexpectedly large. This 
result is consistent with results from strong lensing of galaxies 
by clusters (i.e., giant arcs), which also favors the colhsion- 
less CDM model (Smith et al. 2001; Meneghetti et al. 2001; 
Miralda-Escude 2002; Gavazzi et al. 2003; Oguri, Lee, & Suto 
2003; Wambsganss, Bode, & Ostriker 2004; Dalai, Holder, & 
Hennawi 2004, but see Sand et al. 2004 for different conclu- 
sion). We note that the abundance of large separation lenses 
produces a degeneracy between a and erg seen in Figure 24, but 
additional statistics such as the distribution of time delays can 
break the degeneracy (Oguri et al. 2002). 

Table 5 summarizes the sensitivity of our predictions to vari- 
ous model parameters. The uncertainties in our predictions are 
no more than a factor of 2-3, dominated by uncertainties in the 
concentration parameter and the matter density Hm- This error 
roughly corresponds to Aag ~ 0. 1, and so does not significantly 
change our main results. There may be larger systematic errors 
associated with effects we have not considered in this paper. 
For example, triaxiality in cluster halos is very important in arc 
statistics because it can dramatically increase the length of the 
tangential caustic that gives rise to giant arcs (Oguri et al. 2003; 
Meneghetti et al. 2003a; Dalai et al. 2004). The effect would 
seem to be less important in the statistics of lensed quasars, 
which depend mainly on the area enclosed by the radial caus- 
tic, but it still needs to be examined quantitatively. The pres- 
ence of a central galaxy is thought to have a small impact on 
arc statistics (Meneghetti, Bartelmann, & Moscardini 2003b), 
but the complexity of the lens potentials we found in our mass 
modeling suggests that the effect of the galaxy on the statistics 
of lensed quasars also needs to be considered. 

6. SUMMARY AND CONCLUSION 

We have presented the candidate selection and follow- 
up observations of the first cluster-scale lensed quasar, 
SDSS J1004+4112. The system consists of four components 
with image separation ~ 14", and was selected from the large 
separation lens search in the SDSS. The spectroscopic and pho- 
tometric follow-up observations confirm SDSS J 1004+41 12 
to be a lens system; spectroscopic observations of four com- 
ponents showed that they have nearly identical spectra with 
z = 1 .734. Deep images and spectroscopy of nearby galaxies 
indicate that there is a cluster of galaxies with z = 0.68, whose 
center is likely to be among the four components. We conclude 
that the cluster is responsible for this large separation lens. Dif- 
ferences between the C IV emission line profiles in the four im- 
ages remain puzzhng, and it will be interesting to reobserve the 
profiles at several epochs to search for variabiUty that might 
explain the differences. 

We have shown that reasonable mass models can success- 
fully reproduce the observed properties of the lens. When we 
consider models that include both the cluster potential and the 
brightest cluster galaxy, we find a broad range of acceptable 
models. Despite the diversity in the models, we find several 
general and interesting conclusions. First, there appears to be a 



small (^4/z"' kpc) offset between the brightest cluster galaxy 
and the center of the cluster potential. Such an offset is fairly 
common in clusters (e.g., Postman & Lauer 1995). Second, 
the cluster potential is inferred to be elongated roughly North- 
South, which is consistent with the observed distribution of ap- 
parent member galaxies. Third, we found that a significant ex- 
ternal shear 7 0.2 is needed to fit the data, even when we al- 
low the cluster potential to be elliptical. This may imply that the 
structure of the cluster potential outside of the images is more 
complicated than simple elliptical symmetry. Fourth, given the 
broad range of acceptable models, we cannot determine even 
the parities and temporal ordering of the images, much less the 
amplitudes of the time delays between the images. Measure- 
ments of any of the time delays would therefore provide power- 
ful new constraints on the models. We note that the complexity 
of the lens potential means that the time delays will be more 
useful for constraining the mass model than for trying to mea- 
sure the Hubble constant. 

Our modeling results suggest that further progress will re- 
quire new data (rather than refinements of current data). The 
interesting possibilities include catalogs of confirmed cluster 
members. X-ray observations, and weak lensing maps, not 
to mention measurement of time delays and confirmation of 
lensed arcs (either the possible arclets we have identified, 
or others). For instance, with an estimated cluster mass of 
M ~ 3 X IO^'^H'^Mq, the estimated X-ray bolometric flux is 
Sx ~ 10~'^ergs~'cm"^, which means that the cluster should 
be accessible with the Chandra and XMM-Newton X-ray ob- 
servatories; the excellent spatial of Chandra may be particu- 
larly useful for separating the diffuse cluster component from 
the bright quasar images (which have a total X-ray flux Sx ~ 
2 X 10"'2ergs"^cm"2 in the RO SAT AW Sky Survey). The confir- 
mation of lensed arclets would be very valuable, as they would 
provide many more pixels' worth of constraints on the com- 
plicated lens potential. In principle, mapping radio jets in the 
quasar images could unambiguously reveal the image parities 
(e.g., Gorenstein et al. 1988; Garrett et al. 1994), but unfortu- 
nately the quasar appears to be radio quiet as it is not detected 
in radio sky surveys such as the FIRST survey (Becker, White, 
& Helfand 1995). 

Although the large separation lens search in the SDSS is still 
underway, we can already constrain model parameters from the 
discovery of SDSS J1004+4112. The existence of at least one 
large separation lens in SDSS places a lower limit on the lens- 
ing probability that complements the upper limits from previ- 
ous surveys. Both results can be explained if clusters have the 
density profiles predicted in the colhsionless CDM scenario and 
moderate values of the mass fluctuation parameter erg. In partic- 
ular we find as = l-0^0 2 (95% confidence) assuming the inner 
density profile of dark matter halos has the form p oc r"" with 
a= 1.5. The value of as is, however, degenerate with a such 
that smaller values of a require larger values of as- Various 
systematic errors are estimated to be Atrg ~ 0.1, dominated by 
uncertainties in the distribution of the concentration parameter 
Cvir and in the matter density parameter fi^- Other systematic 
effects, such as triaxiality in the cluster potential and the pres- 
ence of a central galaxy, remain to be considered. Still, our 
overall conclusion is that the discovery of SDSS J1004+41 12 is 
fully consistent with the standard model of structure formation 
(i.e., CDM wifli erg ~ 1). 

In summary, SDSS J1004+4112 is a fascinating new lens 
system that illustrates how large separation lenses can be used 
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to probe the properties of clusters and test models of struc- 
ture formation. The full SDSS sample is expected to contain 
several more large separation lenses. The complete sample of 
lenses, and the distribution of their image separations, will be 
extremely useful for understanding the assembly of structures 
from galaxies to clusters. More immediately, the discovery of 
a quasar lensed by a cluster of galaxies fulfills long-established 
theoretical predictions and resolves uncertainties left by previ- 
ously unsuccessful searches. 
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Table 1 

Photome try OF SDSS J 1004+41 12 



Objccl 


i' 


ir -g ' 








Rcdshifl 


A 


18.46 ±0.02 


0.15 ±0.05 


-0.03 ±0.04 


0.24 ±0.03 


0.02 ±0.05 


1.7339 ±0.0001 


B 


18.86 ±0.06 


0.18±0.08 


-0.05 ±0.08 


0.23 ±0.08 


-0.03 ±0.09 


1.7335 ±0.0001 


C 


19.36 ±0.03 


0.03 ±0.05 


-0.03 ±0.04 


0.38 ±0.04 


0.05 ±0.08 


1.7341 ±0.0002 


D 


20.05 ±0.04 


0.15 ±0.09 


0.15 ±0.05 


0.46 ±0.05 


0.09 ±0.13 


1.7334 ±0.0003 



Note. — Magnitudes and colors for the four quasar images, taken from the SDSS photometric data. 
Redshifts are derived from Lya lines in the Keck LRIS spectra (see Figure 3). 



Table 2 

ASTROMETRYOF SDSS J1004+4112 



Object 


R.A.(J2000) 


Dec.(J2000) 


AR.A.[arcsec]'* 


ADec.[arcsec]^ 


A 


10 04 34.794 


+41 12 39.29 


0.000 ±0.012 


0.000 ±0.012 


B 


10 04 34.910 


+41 12 42.79 


1.301±0.011 


3.500±0.011 


C 


10 04 33.823 


+41 12 34.82 


-10.961 ±0.012 


-4.466 ±0.012 


D 


10 04 34.056 


+41 12 48.95 


-8.329 ±0.007 


9.668 ±0.007 


Gl 


10 04 34.170 


+41 12 43.66 


-7.047 ±0.053 


4.374 ±0.053 



Note. — Astrometry from the deep imaging data taken with Suprime-Cam (see 
§3.2). The absolute coordinates are cahbrated using the SDSS data. 

"Positions relative to component A. 



Table 3 
Subaru observations 



Band 


Exptime 




g 


810 


27.0 


r 


1210 


26.9 


i 


1340 


26.2 


z 


180 


24.0 


Note. 


— Total 


expo- 



sure time in seconds (ex- 
ptime) and limiting magni- 
tude (fMiim) for the Subaru 
deep imaging observations. 

^Defined by S/N> 5 for 
point sources. 



SDSS J1004+4112 



Table 4 



Constraints on mass models 



Object 


x[arcsec]^ 


y[arcsec]'* 


Flux[arbilraryj'' 


PALdeg]'^ 


A 


0.000 ±0.012 


0.000 ±0.012 


1.0 ±0.2 




B 


-1.301 ±0.011 


3.500 ±0.011 


0.682 ±0.136 




C 


10.961 ±0.012 


-4.466 ±0.012 


0.416 ±0.083 




D 


8.329 ±0.007 


9.668 ±0.007 


0.195 ±0.039 




Gl 


7.047 ±0.053 


4.374 ±0.053 




-19.9 



Note. — Summary of positions, flux ratios, and position angles (PA) of 
SDSS J1004+41 12 used in the mass modeling. 

^The positive directions of x and y are defined by West and North, respec- 
tively. 

''Errors are broadened to 20% to account for possible systematic effects. 
'^Degrees measured East of North. 



Table 5 
Systematic effects 



Models 




Mens(>7")for(a,as) 




(1.0,0.7) 


(1.5,0.7) 


(1.0,1.1) 


(1.5,1.1) 


fiducial model 


0.0027 


0.071 


0.47 


3.0 


CBuUock CCHM 


0.00002 


0.0097 


0.16 


1.8 


CBuUock — > CJS 


0.00049 


0.065 


0.17 


2.4 


dn]enkim/dM -> dnE„^id/dM 


0.0044 


0.11 


0.53 


3.2 


dnjenkinJdM ^ dnsj^ff/dM 


0.00075 


0.026 


0.22 


1.7 


LF1^LF2 


0.0024 


0.066 


0.42 


2.8 


= 0.27 0.22 


0.00066 


0.027 


0.24 


1.8 


Om = 0.27 ^ 0.32 


0.0083 


0.15 


0.81 


4.6 



Note. — Sensitivity of the predicted number of large separation lensed quasars 
in the SDSS quasar sample to various changes in the statistics calculations. 



